The SGI Origin: A CCNIJMA Highly Scalable Server
نویسنده
چکیده
The SGI Origin 2000 is a cache-coherent non-uniform memory access IccNUMA) tnultionxessor desipned and manufactured bv Silicon ‘Graphics,. Inc. ?he Origin sys;m was desi&d from th> ground up as a multiprocessor capable of scaling to both small and large processor counts without any bandwidth, latency, or cost cliffs.The Origin system consists of up to 512 nodes interconnected by a scalable Craylink network Each node consists of one or two RIO000 processors, up to 4 GB of coherent memory, and a connection to a portion of the XI0 IO subsystem. This paper discusses the motivation for building the Origin 2000 and then describes its architecture and implementation. In addition, performance results are presented for the NAS Parallel Benchmarks V2.2 and the SPLASH2 applications. Finally, the Origin system is compared to other contemporary commercial ccNUMA systems.
منابع مشابه
Large Scale Simulation of Particulate Flows
Simulations of particles in fluid flows are of great interest to numerous industries using sedimentation, fluidization, lubricated transport, and hydraulic fracturing of hydrocarbon reservoirs. Simulating incompressible viscoelastic flows with millions of rigid particles is computationally a very challenging problem. In addition to using sophisticated modeling techniques and numerical algorithm...
متن کاملEvaluting Performance of OpenMP and MPI on the SGI Origin 2000 with Benchmarks of Realistic Problem Sizes
Six application benchmarks, including four numerical aerodynamic simulation (NAS) codes, provided by H. Jin and J. Wu, were previously parallelized using OpenMP and message-passing interface (MPI) and run on a 128-processor Silicon Graphics Inc. (SGI) Origin 2000. Detailed profile data were collected to understand the factors causing imperfect scalability. The results show that load imbalance a...
متن کاملRefreshment Policies for Web Content Caches
Web content caches are often placed between end-users and origin servers as a mean to reduce server load, network usage, and ultimately, user-perceived latency. Cached objects typically have associated expiration times, after which they are considered stale and must be validated with a remote server (origin or another cache) before they can be sent to a client. A considerable fraction of cache ...
متن کاملModeling and Simulations of Complex Low- Dimensional systems: Testing the Efficiency of Parallelization
The deterministic quantum transfer-matrix (QTM) technique and its mathematical background are presented. This important tool in computational physics can be applied to a class of the real physical low-dimensional magnetic systems described by the Heisenberg hamiltonian which includes the macroscopic molecularbased spin chains, small size magnetic clusters embedded in some supramolecules and oth...
متن کاملParallel Maximum-Likelihood Inversion for Estimating Wavenumber-Ordered Spectra in Emission Spectroscopy
We introduce a parallelization of the maximumlikelihood cosine transform. This transform consists of a computationally intensive iterative fitting process, but is readily decomposed for parallel processing. The parallel implementation is not only scalable, but has also brought the execution time of this previously intractable problem to feasible levels using contemporary and cost-efficient high...
متن کامل